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Abstract 

We present images of solar wind electron density structures at distances of 
1 A.U., extracted from the STEREO /Y{\-2 data. Collecting the images requires 
separating the Thomson-scattered signal from the other background/foreground 
sources that are 10^ times brighter. Using a combination of techniques, we are 
able to generate calibrated imaging data of the solar wind with sensitivity of a 
few xlO~^^B0, compared to the background signal of a few xlO~^^B0, using only 
the STEREO /UI-2 Level 1 data as input. These images reveal detailed spatial 
structure in CMEs and the solar wind at projected solar distances in excess of 
1 AU, at the instrumental motion-blur resolution limit of 1° - 3°. CME features 
visible in the newly reprocessed data from December 2008 include leading-edge 
pileup, interior voids, filamentary structure, and rear cusps. "Quiet" solar wind 
features include V shaped structures centered on the heliospheric current sheet, 
plasmoids, and "puffs" that correspond to the density fluctuations observed in- 
situ. We compare many of these structures with in-situ features detected near 
1 AU. The reprocessed data demonstrate that it is possible to perform detailed 
structural analyses of heliospheric features with visible light imagery, at distances 
from the Sun of at least 1 AU. 



Subject headings: Sun: coronal mass ejections (CMEs), solar-terrestrial relations — 
Solar System: interplanetary medium — methods: data analysis 



- 2 - 



Introduction 



The last decade has seen the emergence of a new class of white light imaging instru- 
ments, heliospheric imagers, that combine precise photometric wide-field imaging with ex- 
tre mely deep baffles. Re views of the cap a bilitie s of heliospheric imagers have been written 
bv iHarrison et all koO^ and bv lHowardI feoilh . 



Heliospheric imagers are in principle capable of detecting and tracking both large- and 
small-scale transients in the solar wind by collecting and imaging sunlight that is scat- 
tered directly off of free electrons in interplanetary space, just as coronagraphs capture 
the K-corona in the vicinity of the Sun. The first proof-of-concept observa tions of helio 



spher i c transients were dem onstrated with the Helios zodiacal light instrument (IRichter et al. 



19821 : iJackson et al.l Il985l ) and the first wide-field dedicated heliospheric imaging instru- 
ments were_the_SolarMass Ejection Imager (SMEI), launched in 2003 on the Coriolis space- 
craft (lEvles et al.l 120031) a nd the STEREO mission Heliospheric Imagers (His), launched 
in 2006 ( Eyles et al.l 2009 ). These instruments have proven to be ef fective for observing 



solar wind transient phenomena such as coronal mass ejections (e.g. [Howard et al. 



Harrison et aD boosi: IWebb et al.l 120091). corotating interaction regi 



2008 



Rouillard et al. 



2008 



20101 ) and "blobs" (e.g. ISheelev et al. 



2009 



j;ions y&.g. 



2006 



Sheelev et al. 



Tappin fc HowardI 2009a), and solar wind "p uffs" (e.g. Rouillard et al 



Sheeley and RouillardI 120101 ). 



Small, non-CME structure (puffs and blobs) have been observed in the solar w i nd in 



the outer corona (e .g. ISheeley et al.l 119971 ). by in-situ space craft fe.g. ISchwenn et al.lll990 



Phillips et al.lll995[ ). using interplane tary scintillation ( e.g. iRickett &: ColesI Il99ll ). and by 



the STEREO /Y{\-1 instruments (e.g. IClover et al.ll2010l ). Quantitative, detailed imaging of 



these structures has remained elusive at elongation angles above 40° (in the STEREO /H.l-2 
field of view). 

The greatest challenge in heliospheric imaging is signal extraction of the faint Thomson 
scattering signal from the far brighter foreground and background signals. Large CMEs have 
intensities (at 45° solar elongation) of order 10~^^ Bq, and the faintest remotely tracked solar 
wind features (puffs) have intensities of the order of a few x 10~^^ ( iRouillard et al.ll2010l : 
Tappin &: HowardI l2009af). (1 B^n is a solar bri ghtness unit ~ the surface brightness of the 



solar photosphere). iTappin &: HowardI ( l2009a( ) have demonstrated that careful analysis of 
differenced elongation- vs-time i mages ("J-maps" or "J-p lots") can detect features with bright- 
nesses of a few xlO~^^ Bq, and IRouillard et al.l ( 120101 ) has performed simlar J-plot analyses 
of small transient structures propagating with the ambient wind and becoming entrained in 
a CIR. Both groups used the structure of the J-plot imagery to reveal morphology of features 
with signal strength at or below the noise level imposed by the background star field in their 
processed data. To our knowledge no group has yet produced clear 2-D imagery of evolving 
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wind features in HI-2 at comparable or greater sensitivity. 

These faint signals are detected by STEREO /H.I against a background dominated by 
three principal sources: instrumental stray light, which forms a fixed pattern on the detector; 
zodiacal light (which we also refer to as "F corona") at the few x 10^^^ level that varies only 
slowly as the spacecraft orbits the Sun; the background starfield, which is extremely spiky 
(10^^^ Bq for a 10th m agnitude star in t he HI-2A instrument) and drifts with the orbital 



motion of the spacecraft ( lEyles et al.ll2003l ): and small artifacts, reminiscent of cosmic rays or 
dust, that are visible in many images despite on-board processing that removes most cosmic 
ray impacts. SMEI is also subjected to saturation from magnetospheric particles, auroral 
glow, and moonlight. The stray light pattern in STEREO /HI is weak compared to the F 
corona. 

In the present paper, we describe and demonstrate a technique by which the HI-2 data 
can be background-subtracted to reveal solar wind structures throughout the field with 
residual noise levels of ~ 3 x 10~^^ Bq in 1° square patches of image, within a factor of three 
of the photon counting noise level (which is ~ 10"^'' Bq against a background of 10~^^ Bq 
in the standard HI-2 data set, when averaged over a 1° square patch). 

We have reprocessed HI-2A data across a time period of several days surrounding a 
well-documented CME that occurred in December 2008. The reprocessed data have a noise 
floor lower than has been available in previous analyses, and reveal rich detail in the solar 
wind and CME structure, that has been previously inaccessible. 

Along with the CME we identify several solar wind transient features, including the CME 
itself, a forerunner, a filament, and some small transient s tructures that may c orrespond to 



puffs" and "blobs" observed by lRouillard et al.l (120101 ) and lSheeley et al.l (120091 ) respectively. 



Because the technique does not require difference subtraction, we are also able to identify 
cavity regions surrounding the CME event itself that have been undetectable with previous 
analytical techniques. Making J-plots from the resulting cleaned movie shows striking corre- 
spondence between the apparent arrival times of the observed heliospheric features at Earth, 
and the detection of similar density structures by the Wind spacecraft. 

We conclude with discussions of the limitations of image processing for extracting solar 
wind heliospheric image data, of physical implications of high quality imaging of transients 
in the solar wind, and planned work to exploit the newly available data sets. 
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Instrument and Event 



We co nsider the outer heliospheric imager (H I -2A) on board the STEREO-A ("ahead") 
spacecraft (lEyles et al.l |2009[ [Howard et aLl l2008t iKaiser et al.ll2008[ ). HI consists of two 
cameras, HI-1 normally observes across an elongation range of 4° to 24° and HI-2 from 19° 
to 89°, both centered on the line formed by the image-projected ecliptic plane. STEREO 
was launched so that its angular separation from the Sun-Earth line increased at a rate 
of ~22.5° per year. During the time period of interest (December 2008), STEREO-A was 
42° away from the Sun-Earth line and at a radial distance of 0.97 AU from the Sun. HI-2 
operates on a 2 hour cadence, summing many individual camera frames into 7,000 second 
"macro-exposures" on board the spacecraft. The macro-exposures are despiked on-board 
in the temporal direction to remove cosmic rays and similar artifacts before summing. The 
camera has an active image planes of 2048 x 2048 pixels but is summed 2x2 before downlink, 
so that reduced images are 1024 x 1024 pixels in size. 

We chose to focus on HI-2A alone rather than a joint study with HI-2A and HI-2B (on 
the "behind" spacecraft) because the starfield in HI-2B images is more difficult to isolate than 
from HI-2A, due to an apparent defocus seen in the HI-2B images. That defocus is smaller 
than the expected motion blur we describe (1°- 3°depending on feature speed), but large 
enough to require slightly different techniques for starfield subtraction than are described 
here. 

The event of primary interest was a CME that erupted from the Sun on 12 December 
2008 and reached the HI-2 field of view the following day. It is generally regarded as the 
first geoeffective CME observed duri ng th e STEREO mission and has been documented by 
Davis et al.l (120091 ) . iLiu et al.l (120 lOf ) and iLugaz et al.l ( 120101 ). Our observational coverage 
includes the time period from 11-21 December, which includes the pre- and post-CME solar 
wind in this region. We set out to extract and observe whatever structure we could from the 
event as it propagated outward. 



3. Processing 

We removed background from the HI-2 images in stages. Like other unpolarized coro- 
nagraphs such as SOHO /LASCO, the HI-2 images include a strong image component from 
dust (the zodiacal light or "F corona"), which cannot in principle be separated from the fixed, 
smooth bulk component of the desired Thomson-scattered ("K-corona") signal. Because the 
F corona is nearly constant, it can be separated from dynamic transients in the K coronal 
signal by tracking and subtracting the minimum brightness in any single pixel on the focal 
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plane over an interval long compared to the crossing time of the features of interest. This 
yields images of the "excess density" due to transient features. A major complication for 
HI-2 is the background starfield. Unlike coronagraph data, both the F and K coronal signals 
in the HI-2 field are faint compared to the bright stars in the starfield. Further, the starfield 
has different motion characteristics than the F corona because of spacecraft orbital motion, 
and is extremely spiky, stretching the limits of pattern identification and rectification. 

Because the Thomson signal is so faint, additional steps are required to remove second- 
order effects not visible in less challenging data, which requires two additional filtration 
steps. Noise sources are summarized in Table [TJ Two bright planets were visible in the 
field of view - Earth and Venus - and they saturated the detector, leaving top-to-bottom 
saturated streaks that damaged some of the later filtering steps. We cropped the field of 
view of our observation to eliminate them. 

We began our analysis with STEREO-snpplied Level 1 data, which are linearly cal- 
ibrated (no nonlinear calibration applied) in normalized digitizer counts per second (DN 
s^^). An example frame is given in Figure [H which has been square-root scaled to show the 
dominant features in the data: bright stars interspersed with the F corona at exposure levels 
of a few tens of DN s^^. The processing includes five major steps: stationary background 
removal; celestial background removal (including cross-image distortion measurement); resid- 
ual F corona removal; moving-feature filtration in the Fourier plane; and conversion back to 
focal plane coordinates. The K coronal signal is not visible at this brightness scale, but is of 
order 0.02 DN s~^ compared to the scale of 25 DN s~^. 

3.1. Stationary background removal 

The F corona and stray light signals are both approximately stationary on the image 
plane. We do not attempt to separate them, but remove both with the same image processing 
step. We identified the stationary background by simple stacking of the complete 11 day 
data set. We treated each pixel as a statistical population of brightnesses, and examined the 

Table 1: Characteristics of background and noise sources in the STEREO /B.l-2 data 



Background source 


Motion 


Amplitude (DN s^^) 


Stray light 


stationary 


up to 30 at bright edge 


F corona 


approx. stationary 


up to 30 


Starfield 


moves l°/day 


up to 20 


Galaxy 


moves l°/day 


up to 20 
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HI-2A2008 Dec15 12:09 (Levi) 

I I , I ■ ■ . I _ 




Fig. 1. — Unprocessed STEREO /UI-2 Level 1 image at 0-25 DN s~^ (square root scaled to 
increase dynamic range), obtained on 15 December 2008 at 12:09 UT (2 hour accumulation). 
The Sun is to the right, but well outside the image, and the coordinates are in units of image 
pixels. Dominant features include the F corona, starfield, and CCD artifacts including the 
two planets Earth (left) and Venus (right). The planets saturate the CCD causing "bleeding" 
of charge in the vertical direction and defining the width of the usable field of view for our 
analysis. The K coronal signal of interest has an amplitude of 0.02 DN s~^, 1000 times 
fainter. 
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faintest brightness value from each pixel. Data dropouts are set by the STEREO pipeline 
to have negative numbers. For the sorting, we treated these numbers as large, so that the 
faintest valid values of each pixel could be considered. The faintest and brightest value 
images for the data set spanning 11-21 December 2008 in HI-2a are shown in Figure [2J The 
faintest-value image contains primarily F corona and stray light. The brightest- value image 
on the right of Figure [2] clearly shows the starfield as a set of streaks: as each bright star 
passes across the field of view over the data set, it sets the brightest value of that pixel. The 
optics distortion function is visible as a variation in the characteristic length and angle of 
the streaks across the field. 

Measuring the F corona and stray light requires slightly more care than simply extracting 
the minimum value. Three important artifacts remain: overexposure streaks from planets 
and other bright features; residual streaking from the starfield; and statistical noise-induced 
errors from the F corona. The first two can be combatted with in-image filtering; the 
latter is more problematic. Figure [3] shows the minimum value image, unsharp-masked to 
highlight these remaining structures that are present at the ±0.1 DN s~^ level. The exposure 
artifacts can be removed by using a low-percentile image (we chose the 5th percentile in pixel 
brightness) at some cost in additional starfield artifacts. The starfield can be further reduced 
with spatial median filtration, and we use a 5 x5 median filter on our fixed background image. 
This technique is similar to that used used by the STEREO /H.l-2 official pipeline, which 
involves taking a per-pixel average of the lowest quartile of values in an 11 day window. 

The speckling toward the right in Figure [3] is more problematic than the other two 
error sources in the fixed pattern image, and we attribute it to variations in the noise 
level of different detector bins on the CCD, coupled with the skewed sample imposed by 
a minimum-value measurements. The Level 1 data are linearly flat-field corrected and blur 
compensated for exposure during readout, but it is of course not possible to eliminate all 
detection noise, and the data value at each pixel is the sum of the image plus samples of 
independent random variables from photon counting and the inherent flxed-pattern noise 
of the CCD. The minimum- value operation across an ensemble of raw images necessarily 
produces a skewed sample of these detection noise sources. The two sources introduce two 
visually distinct signatures. The flrst is due to the flxed pattern noise of the CCD and appears 
as "speckle" in the brightest portion of the image (center-right). The second is due to skewed 
sampling of the photon noise from the F corona itself. It yields both a brightness-dependent 
level of "snow" in Figure |3] and a systematic underestimation of the background brightness. 
It may in principle be possible to correct for this underestimation, by characterizing the 
independent statistics of the flxed pattern noise and photon counting noise. 

A nice compromise between the rabbit hole of independent statistical treatment of noise 
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2008 Dec 11-21 min value 2008 Dec 11-21 max value 




Fig. 2. — Faintest- and brightest- value images from the STEREO /Y{\-2A Level 1 data span- 
ning 2008 Dec 11-21. (Left) Faintest-value image shows F corona and stray light; (Right) 
brightest-value image shows moving starfield, spikes from near-spacecraft dust or cosmic 
rays, and marked data dropouts. The faint image was further processed to reduce starfield, 
and kept as the F corona model for the data set. Both images have been scaled to 0-50 
normalized HI-2 DN and square root scaled to display faint features. 

min vafue, unsharp masked 




Fig. 3. — Minimum value image from Fig. [2] unsharp masked with a 9x9 pixel boxcar to 
show detail. Residual streaks from the starfield are visible throughout, as are artifacts from 
planetary overexposure and speckles from statistical variation in the F corona. 
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random variables in each pixel, and simply ignoring the problem, is skewed median filtering 
of the background. Selecting the 30th percentile value from each 5x5 pixel neighborhood 
removes much of the local brightness from the faint starfield at the cost of increasing F coronal 
leakage slightly from the skewed noise sample. Figure H] shows the quality of the fit: the non- 
stellar fixed pattern is reduced by a factor of approximately 300 by background subtraction 
with the skewed-median background i mage. As with othe r broadband unpolarized Thomson 
scattering data (e.g. from LASCO C3 iMorrill et al.ll2006l ) it is not possible to separate the F 
coronal (zodiacal light) signal from the fixed portion of the K coronal (Thomson scattered) 
signal. The subtracted image also demonstrates why there is stellar structure in the minimum 
image. At the faint end of the stellar brightness distribution, there are stars in most pixels 
with only isolated darker ones. 



3.2. Celestial background removal 

Removing the stationary and quasi stationary background components leaves the celes- 
tial background of stars, galaxies, and other features that remain fixed on the celestial sphere 
rather than the image plane. The starfield is the most challenging noise source to remove, 
because individual stars take the forms of spikes on the image, with high gradients. Subtrac- 
tion of a model starfield requires deeply sub-pixel alignment on the image. The alignment 
is made more challenging by the fact that the starfield moves on the image plane between 
exposures as the spacecraft orbits the Sun, and by the fact that the optics impose a nontriv- 
ial projection function on the wide-angle field of view. We generated a model starfield from 
the data themselves, fitting the distortion function D between images as the composition of 
a parameterized deprojection function from the image plane to the celestial sphere. This is 
followed by a time- dependent orbital transformation, and reprojection onto the focal plane: 

D,^{x,y,M) = {P,^o S,^m o P-^) {x,y) (1) 

where Pg. is the projection function from celestial coordinates to image coordinates and is 
invariant under nominal operations (though it includes a rotation to orbital coordinates that 
could in principle change if the instrument pointing changed from the nominal ecliptic-plane 
orientation), Sq^ is the orbital-shift function in ecliptic coordinates and contains all the time 
dependence in D, and the qi are the fit parameters. As a check on the projection correctness 
we fitted the orbital speed and spacecraft attitude as part of the parameterization. The 
fit was accomplished by measuring the in-plane coordinate shift between the starfields in a 
collection of points from two separate images collected five days apart, and adjusting both P 
and 5(2. 5d) to minimize the RMS error between the modeled and measured shifts. Because 
5* is approximately linear in time (treating the orbit as circular), and P is invariant across 
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exposures, this yielded a coordinate transformation capable of transforming the starfield 
between the focal planes of any two exposures at known times in the same general part of 
the STEREO orbit. 



3.3. Measuring cross-image distortion 

Fitting the functions in Equation ([1]) requires measuring D{x,y). We measured the func- 
tion with a regular grid of tiepoints that were used as the basis of correlative fits between two 
images taken 5 days apart. In our first attempt, we centroided bright stars (on the irregular 
grid formed by the stars themselves); but the centroid identifications had positional noise at 
about the 0.2 pixel RMS level, with occasional outliers (in bright stars) of about 0.5 pixel 
displacement. We attribute this noise source to subpixel variations i n detector sen s itivity , 



coupled to the strong gradients in the stellar images, as described by I Jackson et al.l (120041 ). 
Switching to patch correlation between corresponding patches in the earlier and later images 
yielded much lower noise, which we attribute to dithering of the subpixel gradients across all 
the stars in each patch (typically dozens to hundreds of faint stars), thereby beating down 
this noise source via multiple sampling. A further noise source, which is important even to 
the correlative fits near the edges of the image, is variation in the instrument PSF across the 
image field of view. Near the edges of the field of view, stars become visible elongated and 
the correlative fits appear correspondingly noisier. PSF effects are not important near the 
center of the image plane. 

We arrived at a square correlated patch of 31x31 pixels in size, large enough to yield 
a large central correlation spike at the proper alignment and null the correlation signal 
for spurious alignments of different bright stars between the two images. Typical patches 
contained dozens of individually resolved stars in addition to the unresolved faint starfield. 
To better equalize the correlation contribution between bright and faint stars, we took the 
square root of each image patch. Typical correlation coefficients with 1 pixel offset were 0.4- 
0.6, though a few coincidental missed fits yielded local maximum correlation coefficients as 
high as 0.80 for misaligned starfields. For the proper subpixel displacement typical correlation 
coefficients were above 0.995. The patches were chosen on a regular grid, and missed fits 
were rejected. In general, fits near the edge of the field of view were missed, along with fits 
that included a planetary saturation spike in one or the other image. 

The initial guess offset, which we used to find the corresponding patch on the later image 
and seed the correlation fit, was formed based on a horizontal offset of 72 pixels in 5 days. We 
calculated the correlation coefficient between the two images 5 days apart on a 15x15 grid of 
integer pixel offsets around the horizontal offset location, and the subpixel fit was seeded with 
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the be st offset amonK t he trial offsets. Further ahgnment was fit using a simplex/amoeba 



fitter (iPress et aLlll989l ) with initial simplex radius of 0.75 pixels, centered on the best trial 
alignment. Patches were rejected if either the original or the shifted patch, with margins, 
intersected the edge of the telescope image plane or contained even one saturated pixel, or 
if the final best-fit correlation coefficient between the patches in the earlier and later frame 
was below 0.99. Figure [5] shows the horizontal and vertical components of the measured 
D{x, y, 2.5d), after removal of the base horizontal 72-pixel offset. 

The Dqi distortion function was assembled from parameterized components as in Equa- 
tion ([1]). The amount of displacement in S was a parameter of the fit, to verify that the 
spacecraft's orbital speed was reproduced by the fitter. The Pqi was a perspective projection 



of the celestial sphere with variable camera angle an d ij, par a meter (dist ance from pro 



ecting 



point to the center of the sphere) , as described by ISnyderl (Il987l ) and iBrown et al.l ( 120091 ) . 
We hoped to use the starfield, as did Brown et al., but to characterize the frame-to- frame 
offsets to better than their reported 1 pixel precision by refining the family of coordinate 
transformations used to relate celestial and STEREO focal-plane coordinates. 

The initial guess P was a stereographic projection of the celestial sphere (i.e. /i = 1) and 
the initial guess S was based on a 1° d^^ rotation about an axis parallel to the vertical axis 
of the image. The qi were iterated through an amoeba fitting process to find the best overall 
D using the sum-of-squares distance between the two sets of tiepoint pixel coordinates after 
transformation to the central time between the two images. 

Initial free parameters were: pointing of the projection center relative to image center 
(X and Y), roll and "B" (out-of-plane) angle of the projective plane relative to the ecliptic 
pole, orbital rotation 6, and projection distortion parameter /z. The roll and B angles, 
in particular, were treated as instrumental constants across the dataset. This is a valid 
simplification for a single few-day data set, though for a more general pipeline process the 
two angles should be allowed to change according to spacecraft orbital phase relative to the 
celestial nodes of the orbit. 

The initial fitting process led to a 6-parameter coordinate transformation with a residual 
RMS error of 0.35 pixel between the measured and modeled distortion parameters at the 
kept tiepoints, compared to the initial error (depicted in Figure [5]) of 3.6 pixels. An order 
of magnitude improvement isn't bad, but more improvement was possible. The residuals 
contained obvious errors in a quadrant pattern for vertical displacement and in a vertical 
striation pattern for horizontal displacement, and each was treated with ad- hoc perturbation 
functions of known pattern and variable amplitude, adding two more parameters to the fit. 
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The horizontal displacement ad hoc distortion was 



x^ = x + /?sin — ^ cos 7T trio (2) 



2 512 y V2 512 
and the vertical displacement ad hoc distortion was 



y' = y + a siv? 



512 



where a and /3 are the fitted amplitude coefficients and x and y are pixel index coordinates 
in the original image plane before any other coordinate transforms were applied, i.e. 

^ final ~ ^G-param ° fadhoc{x,y). (4) 

With the a and (3 parameters, a total of eight fit parameters were fitted to the measured 
displacements of each patch. The ad hoc functions had the correct performance for the central 
region of the image plane, but diverged slightly in the periphery of the valid images. For 
the purpose of this exploratory study, we simply discarded the region of poorer convergence, 
limiting the study to a 400x750 pixel region at the center of the image plane. Limiting 
the field of view is a good choice as it limits other edge effects such as vignetting and PSF 
variation at the periphery of the image plane, and also allows us to avoid the two top-to- 
bottom saturated regions in this data set (one from Earth and one from Venus). 

After iterating the distortion fit with the two ad-hoc correction functions, the distortion 
residual error was 0.09 pixel RMS in the target region. The residuals are shown in Figure 
ini Although this represents a large improvement in characterization of the starfield motion, 
with better or more complete selection of final correction functions there is still room for 
further improvement, as the residuals still contain visually identifiable patterns. 

The advantage of fitting the pixel displacement function as in Equation ([1]) is that new 
displacements can be computed using only the time difference between any two images in 
the same data set. We used the fitted P{x, y) and a computed S'(At) to warp every HI-2A 
image from 11 Dec 2008 at 00:09:21 through 21 Dec 2008 at 22:09:21 to a central time of 
14 Dec 2008 at 00:09:21, to fix the starfield in the datacube. That fixed starfield was then 
used to calculate the median value from every pixel, which formed a background starfield 
image that could be subtracted from the cube. Because of the high gradients around the 
bright stars, the median subtracted images still contained paired stellar images due to the 
subpixel misalignments. These took the form of dual (positive-going and negative-going) 
spikes around each bright star on opposite sides of the PSF core (2-3 pixels apart) and, 
near the periphery of the image where the PSF variation becomes important, triple spikes 
(two positive-going and one negative-going, or vice versa) around each bright star due to the 
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changing width of the PSF. We ehminated these artifacts using a 7x7 pixel running median 
filter - each pixel was replaced by the median of the values in a 7x7 square around itself. 
The 7x7 median filter effectively blurs the images by 0.5°, well within the expected motion 
blur from 300 km/s solar wind, which is of order 2° at 0.5 AU distance from the observer. 
The median-filtered images were then given similar treatment to the F corona subtraction 
step above. We identified the distribution of brightness values in each pixel, selected the 
10th percentile as a background value, and subtracted it from each distorted image. The 
results of these steps are given in Figure [71 

After starfield removal, the remaining visual features have an amplitude of approxi- 
mately 0.1 DN s~^. Near the center of the data set, the starfield null is good and essentially 
no structured stellar artifacts are seen. Nearer the extremities in either space or time, the 
starfield begins to "punch through" due to residual errors in the fitting function and to PSF 
variation across the original image field. However, the background level is no longer domi- 
nated by stars but by second-order effects due to the motion of the F corona during the data 
collection period. 

Even near the center of the image, the starfield remains faintly visible in the processed 
data as randomly fluctuating light/dark spots at the location of the pre-subtraction star. 
The fluctuations have no discernible (to us) spatial or temporal signature that could be 
exploited to reduce this noise source further, and they do not have the characteristic two- 
or three-peak structure characteristic of misaligned stars. We attribute the fluctuations to 
slight variations in the effective gain of the de tector depend i ng on the phase of the stellar 



image relative to the pixel grid as described by I Jackson et al.l (120041 ). This is the same effect 
that forced us to use patch correlation rather than direct centroiding of stars to measure the 
distortion function. 

Throughout the fitting and analysis process, careful attention to resampling is critically 
important because starfields stress the resampling operation and the usual interpolative 
shortcuts will spoil the data. When the starfield is "frozen" by resampling with bilinear 
interpolation (as is common in the heliophysics imaging community), small ripples are visible 
in the resulting starfield movie, corresponding to small motions of the stellar centroids of 
up to 0.5 pixel in the resampled data and peak amplitude fluctuations of tens of percent. 
In that case the starfield nulling operation is completely spoiled. To eliminate th is type of 



artifac t, we used spatially variable Jacobian-optimized resampling as described by iDeForest 



(120041 ). 



As an interesting aside, small, faint objects are easily visible in movies of the frozen 
starfield. When visually inspecting the frozen-starfield movie, we noticed a small moving 
object that proved to be Uranus (Figure IHl). 
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3.4. Residual F corona removal 

After removal of the bulk starfield and fixed F corona, we have reduced the dynamic 
range of the artifacts from ~30DN to ~0.1DN s~^, a factor of 300 improvemnt. At this 
level, second order effects in the F corona come into play, seen as the large scale variation in 
brightness in Figure [71 The residual brightness enters because the F corona is not completely 
time-independent as assumed in §3.11 We removed the brightness variation by making a 3rd 
order fit to the time varying brightness of each pixel across the full 11 day observing window, 
and subtracting that 3rd order fit from the brightness of each pixel. The automated fit didn't 
necessarily find the correct offset to preserve the zero point, so after subtracting the 3rd order 
fit, we subtracted from each pixel the minimum value of the 5x5 pixel median centered on 
that pixel, over the whole 12 day run. This process yielded relatively clean images of the K 
coronal component of the images, but some stellar artifacts remained visible. In particular, 
although the spatial median filter in §3.21 removed the large spikes due to stellar images, 
stars are preserved by two effects: 

1. the spikes offset the distribution of values in the remaining pixels in each 5x5 pixel 
region, so that the median does change slightly when the distribution of values includes 
a spike (though not as much as does the mean); 

2. the instrument point-spread function is not perfect, so that around each bright star is 
a small halo of points that are brighter than deep space by a value comparable to the 
signal for which we are looking. 

Both effects give rise to bright artifacts that are stretched in the direction of stellar motion 
on the detector, because they influence the signal in the fixed-focal-plane segment in §3.H 
where the stars are in motion. These halos are particularly visible in Figure [7| as horizontal 
bright artifacts. They are difficult to remove with single-image or pairwise image processing, 
because they have approximately the same signal strength and size as features of interest. 

3.5. Fourier moving-object filter 

We removed the remaining celestial-frame fixed artifacts with a Fourier filtering process. 
The entire 12 day image set was Fourier transformed in 3-D to kx,ky,u) space, and filtered 
with a moving-object filter that rejected all signal associated with wave speed slower than 
two image pixels per frame and all fixed signal in the current (celestial) coordinate system. 
The orbital motion of the starfield in the FOV is approximately 1.2 pixels per frame. For 
HI-2A, this motion is in the opposite direction to the solar wind propagation in the field of 
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view, so our two pixels per frame limit corresponds to less than one pixel per frame in the 
original data. 

To perform the filtering, we kept all Fourier energy inside in a cone with slope of 2 
pixels/frame near the uo axis in the celestial-frame resampled data, and rejected Fourier 
energy outside the cone. To prevent ringing, we applied a Hanning-style cosine rolloff with 
width 1/3 of the base filter width, centered on the cutoff frequency. This process makes use 
of the fact that the solar wind moves in the field of view, while the artifacts are either fixed 
or slow-moving, and has been used in a solar context primarily in the opposite sense - to 



remove P mode oscillat ions from solar surface magnetogram sequences (iHagenaar &: Shine 



20051 ; iLamb et al.ll2008[ ). Fast moving objects in magnetogram sequences are typically P 
modes that have high phase speed, and are rejected, while slow moving objects are the 
desired signal. Here, the sense is reversed, and we preserve moving solar wind features while 
rejecting the fixed pattern of the starfield. 

In addition to slow moving components, we rejected the fixed (w = 0) component of the 
Fourier image, which eliminates both the mean brightness and pattern background. This loss 
was repaired post facto by identifying the "zero point" of each pixel to be the minimum value 
of a 25x25x5 voxel average centered on each voxel in the 3-D {x,y,t) data space. When 
the zero point is subtracted, the remaining signal contains only the brightness component 
contained in moving features. The result is demonstrated in Figure [9l which shows the same 
three frames (out of 121) as in Figure [TJ after full processing in celestial coordinates. 



3.6. Conversion to focal plane coordinates 

After processing, we resampled the filtered images back into focal plane coordinates 
to yield a dataset in the original HI-2 coordinate system, with the background removed. 
We also applied the preliminary HI-2 calibration of 1.1 x 10~^^ Bq DN~^ s (C. Eyles, priv. 
comm.). The final result is shown in Figure [TU] and, in the digital edition of this article, in 
the attached movie file. 



4. Results 

Figure [10] shows extracted still frames from the data set and, in the digital edition of 
this article, a movie of the entire data set in original observing coordinates. The images 
are cropped to avoid both Venus and the Earth, and therefore show only a section of the 
HI-2 field of view, spanning about 25°x60°. The FOV moves gradually to the right as time 
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increases, because the cropping, starfield processing, and Fourier filtering were carried out 
in celestial coordinates before being transformed back into observing coordinates. 

The brightness scale of Figure [TUlis 0—2.5 x 10^^^ B©; individual features can be observed 
at brightnesses of ~ 2 x 10"^'' Bq. A great deal of structure appears in the sequence, and 
we have identified a number of features that can be isolated from the background solar wind 
activity. The level of variance in the solar wind is itself remarkable. 



HI-2 observations of specific classes of moving features (jSheeley et al.ll2008al ) have re- 
vealed variability in this altitude range, but to our knowledge this is the first algorithm 
capable of fixed computed-background subtraction to reduce image contamination below the 
brightness range of fine-scale features in the solar wind and enabling a direct view of evolv- 
ing structures as they propagate. More detailed scientific analysis will appear in a following 
paper; here, we confine ourselves to a first-look examination of the data. 

The CME itself first appears in the image dated 2008-12-15 at 00:09:21 in Figure [10] 
but a number of distinct features in the solar wind can be identified and tracked prior to its 
appearance. They appear across the top row and we have labeled them "Solar wind puffs". 
The second and third row shows the passage of the CME and a number of accompanying 
features that can also be identified and tracked, including "remnant loops" that may be 
indicative of flux rope structure and that surround an empty cavity or void. The structure 
slightly ahead of the CME in the second and third fra mes of the second row (2008-12- 
15T06:09:21 and T12:09:21) may be a CME forerunner J Jackson fc Hildneil EgTsl ) and the 
bright structure toward the south of the CME across the third row may be a prominence 
or streamer blowout. We have also identified features fiowing in the wake of the CME, two 
different types have been labeled "Plasmoid at CME cusp" and "CME trailing cusp" in the 
bottom two rows. The former is aligned near the center of the CME and appears to be 
either a related sympathetic eruption or part of the original magnetic structure comprising 
the pre-launch coronal field. The latter has a concave-outward V shape and straddles the 
solar equatorial region, meaning that it may be associated with the heliospheric current 
sheet. 

Behind the CME front in Figure [TU] are two clear, dark void s. Voids behind CMEs have 
been detected remotely via r adio interplanetary scintillation (ITappin et al.l Il983l : iTappin 
19871 : ITappin fc HowardI |2010| ) but have not been directly imaged before at such high elon- 
gations, because heliospheric imaging studies have either used running difference images, 
thereby confusing dark regions with bright regions that were present in the previous image, 
or have not achieved the level of sensitivity required to observe the voids and trailing faint 
structure. We tentatively identify the voids with the cavity in the the classical three-part 
CME often observed in coronagraphs ( jSime et al.lll984j ) and also with the region of reduced 
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density often observ ed accompanying (and sometimes ide ntified as a signature of) in-situ 



magnetic clouds (e.g. IWu &: Leppine]|2002[ IWei et al.ll2003[ ). When the feature impacted the 



Wind spacecraft a void was detected in-situ (see below). 

Figure [11] shows the data sequence in the form of an elongation-time "J-plot", produced 
by selecting a position angle slice across an elongatio n-time-intensity data cube, and utilized 
widely by the HI community (e.g. iDavies et al.ll2009[ ). A number of position angle slices have 
been chosen, measured relative to the ecliptic plane. Many of the features are labeled where 
they correspond with those in the movie frames. 

Figure [12] shows a J-plot for the ecliptic plane (+5° relative to STEREO-A heliographic 
coordinates during this time of year) built from the processed data sequence. The Earth 
is indicated as a dashed horizontal line near 70° elongation. This allows direct comparison 
between the HI-2 features and in-situ density data, and the density measured by Wind/SWE 
for the same time period is also shown above. Most of the features that we can track in 
the HI-2 dataset at this heliospheric latitude also appear in the in-situ data. Several labeled 
features from Figure [TU] are indicated in Figure [T2] and extended to the Earth with a dashed 
line. A few features (e.g. the second puff) do not appear or cannot be traced all the way 
to Earth (e.g. P4), either because they miss the Earth's heliospheric longitude are weak 
against the background or other brighter features. Three-dimensional analysis is planned 
using combined background-subtracted data from both of the STEREO spacecraft and from 
other auxiliary data sets. 



5. Discussion 

We have demonstrated extraction of a quantitative solar wind imaging signal from a HI- 
2 image sequence containing 3-4 orders of magnitude brighter background than the desired 
data. We are able to approach, but not achieve, the photon counting noise limit in the 
STEREO /m-2 instrument. 

The limiting factors in the final image quality of our data set are (1) stellar background 
"print-through", which we attribute primarily to subpixel variations in CCD sensitivity cou- 
pled to the relative phase of the pixel grid and the star's location; and (2) motion blur from 
the standard two hour accumulation time for HI-2, with corresponding blur of up to a few 



degrees with typical feature speed ranges (e.g. [Howard fc Simnettll2008l ). The stellar back- 
ground print-through dominates over photon counting noise in the feature size range that is 
observable in our data set, by a factor of about 3. 



Because we use the image stream itself as a background source, exploiting differences 
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in behavior and spatial distribution to distinguish desired data from the background, it 
is not possible to extract absolute brightnesses - only absolute "excess brightnesses" of 
moving features, as is typical for other unpolarized Thomson-scattering detectors such as 
iS'Oi/O/LASCO (iBrueckner et al.lll995[ ). Further, the final filtering step makes use of the 
fact that the solar wind apparent speed is fast compared to the angular orbital speed of the 
spacecraft, and therefore the processd data are not suited to imaging of features in the solar 
wind with little apparent motion, such as corotating interaction regions. 

Further improvement of this type of data would require (1) shorter effective exposure 
times or better motion compensation; (2) a way to minimize stellar print-through; and (3) 
an absolute measurement of the celestial background independent of the desired Thomson- 
scattered signal. Happily, all three are technically feasible: (1) may be accomplished with 
a larger instrument or with higher downlinked data volume; (2) may be accomplished with 
higher data volume by either defocusing the optics or dithering the exposure on the CCD 
between individual camera exposured during image accumulation; and (3) polarization can 
be used to separate the Thomson scattered signal from the background signal. Polarization, 
in particular, is important because the stars and galaxy, which are far more challenging than 
the F corona to remove, are unpolarized in broadband visible lig ht, and the slight F coronal 
polarization follows established models of particulate scattering (ILeinert et al.lll997f ). 



We are able to reveal absolute excess brightness of solar wind and CME structures with 
a S/N ratio of up to 30 in 1.5° patches of image, and demonstrate the presence of "voids" 
and complex structure in the wind stream more than 60° (or ~0.85 AU) from the Sun. 
Many features that are visible, including "V" shaped structures surrounding the heliospheric 
current sheet near the plane of the ecliptic, are to be expected from the wind speed gradient 
but have not been observed directly this far from the Sun. 

While the presence of structure in the solar wind has been known since the first in-situ 
measurements of particle density, and some wind transients have been detected with the 
STEREO /HI-2 imagers, to our knowledge this is the first report of quantitative white light 
imaging of detailed solar wind structure so far from the Sun. That is not surprising given 
the challenges inherent in separating the bright starfield from the faint Thomson signal, 
which requires both deep subpixel understanding of the instrument's projection function 
and careful attention to the mechanics of data resampling. We were aided in b oth of these 
goals by extensive use of Perl Data Language (IGlazebrook fc Economoul 119971 ). a scientific 
computing package that includes a framework for pa rameterizing ari d combining coordinate 
transformations, and for optimized data resampling ( iDeForestI 120041 ) . 



Efforts currently in progress, to be followed up in future reports, include three-dimensional 
reconstruction of feature complexes using dual spacecraft views and image analysis (e.g. 
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Howard! 1201 ll ). direct association of CME structure at high elongation with corresponding 
structure at the Sun via continuous tracking from launch to 1 AU; and a more detailed 
analysis of the small scale features revealed in the reprocessed data. 

The analysis technique we present here is based on batch processing of images and is 
limited to a few days per batch, partly by practical constraints of computer RAM and partly 
by motion of the starfield (which induces a trade-off between length of run and field of view); 
however, an open-ended processing pipeline has been prototyped, and works by merging the 
results of multiple batch-processed data sets. That technique and its results will be presented 
in an article to follow this one. 
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2008 Dec 15 (F corona removed) 




Fig. 4. — Background-subtracted version of Figure [H using the background derived via the 
5th percentile image of an 11 day dataset, with skewed-median smoothing as described in 
the text. The dynamic range is 25 x more sensitive than Figure [H The fixed brightness 
pattern containing the F corona has been reduced by a factor of approximately 300 but is 
still faintly visible at right. Many individual stars greatly exceed the dynamic range of this 
plot. 
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Fig. 5. — Initial measured nonlinear distortion function for STEREO /HI-2A shows measured 
horizontal and vertical distortions as a function of position over a 5 day interval in Dec 2008 
in the HI-2A field of view. Kept correlation patch centers are marked and form the data 
comparison points for the subsequent fit of the coordinate transformation. For display, the 
images are interpolated between patch centers using an inverse square distance-weighted 
average. 



Distortion residuals {hor'a.) Distortion residuals (vert,) 




X (pixels) X (pixels) 

Fig. 6. — Residual distortion error after starfield distortion fitting is 0.09 pixel RMS in the 
central rectangle (compare to Figure [5]). Locations where the distortion was measured are 
marked. 
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Fig. 7. — Freezing in the celestial coordinate system and starfield subtraction are demon- 
strated in these three images from the beginning (left), middle (middle), and end (right) of 
the dataset. Note factor-of-50 difference in the brightness scale between top (frozen) and 
bottom (frozen, starfield-subtracted) rows. Solar wind features are faintly visible but are 
obscured by residuum from the F coronal subtraction (see text) . 
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2008 Decl 2 14:09 



2008 Dec 16 18:09 



Uranus 




2008 Dec 21 02:09 



Uianus 



Fig. 8. — Uranus is visible in frozen starfield images from 2008 December. In these three 
panels, it is the only visible moving feature (circled and labeled), and has a central brightness 
of approximately 7 DN s^^. 



2008 Dec 12 14:09 cleaned „ 2008 Dec 16 18:09 cleaned „ 2008 Dec 21 02:09 cleaned 




X (warped pixels) X (warped pixels) X (warped pixels) 

Fig. 9. — Cleaned images, in celestial coordinates, show solar wind at the 0.02 DN s~^ level 
(full scale), with discernible features an order of magnitude fainter still. The brightness scale 
has been reduced by a factor of 1000 from Figure [H The noise level is about 1.5 x 10^^ DN 
s~^ RMS, or about 10^^ of the raw data amplitude. 
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Fig. 10. — Background-subtracted HI-2 frames, in celestial coordinates, from 2008 December, 
show solar wind "puffs" and eddies, a bright CME front, post-front voids, and other related 
structures. The images have been sampled at 6 hour cadence and are a subset of the images 
in the attached movie file in the digital edition of this article. In both the images and the 
movie, the S/N ratio of the brightest features is of order 30, and faint features down to 
3 X 10^^^ Bq are readily seen. 
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Fig. 11. — Elongation-time "J-plot" diagrams for the 11-day observation period in December 
2008 showing multiple features. Each image represents a different position angle measured 
from the ecliptic plane: 15° south (top left); 5°south (top right); 5° north (bottom left); 15° 
north (bottom right). Only the regions subject to the advanced data processing are shown. 
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Fig. 12. — STEREO /}il-2A J-plot with in-situ density data provided by the Wind space- 
craft for the same time period. These have been produced in the same manner as those in 
Figure [m We have selected the position angle to be along the ecliptic, such that the Earth 
(and Venus) are intersected. The Earth, at 70° elongation, is represented by a horizontal 
dashed line. 



